On the huge benefit of quasi-random mutations for multimodal optimization with application to grid-based tuning of neurocontrollers

نویسندگان

  • Guillaume Chaslot
  • Jean-Baptiste Hoock
  • Fabien Teytaud
  • Olivier Teytaud
چکیده

In this paper, we study the optimization of a neural network used for controlling a Monte-Carlo Tree Search / Upper Confidence Trees (MCTS/UCT) algorithm. The main results are: (i) the specification of a new multimodal benchmark function; this function has been defined in particular in agreement with [1] which has pointed out that most multimodal functions are not satisfactory for some real-world multimodal scenarios (section 2); (ii) experimentation of Evolution Strategies on this new multimodal benchmark function, showing the great efficiency of quasi-random mutations in this framework (section 3); (iii) the proof-of-concept of the application of ES for grid-based tuning Neural Networks for controlling MCTS/UCT (see section 3).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy particle swarm optimization with nearest-better neighborhood for multimodal optimization

In the last decades, many efforts have been made to solve multimodal optimization problems using Particle Swarm Optimization (PSO). To produce good results, these PSO algorithms need to specify some niching parameters to define the local neighborhood. In this paper, our motivation is to propose the novel neighborhood structures that remove undesirable niching parameters without sacrificing perf...

متن کامل

Preliminary Design of Spacecraft Attitude Control with Pulse-Width Pulse-Frequency Modulator for Rest-to-Rest Maneuvers

In this paper, the preferred region of design parameters for quasi-normalized equations of single-axis attitude control of rigid spacecraft using pulse-width pulse-frequency modulator (PWPFM) is presented for rest-to-rest maneuvers. Using the quasi-normalized equations for attitude control reduces the system parameters, that is, the moment of inertia, the filter gain, and the maximum torque of ...

متن کامل

Optimizing Design of Stand-alone Hybrid Solar Micro-CHP ‎Systems Using LUS Based Particle Swarm Optimization ‎Algorithm ‎

Utilizing the combined cooling, heating and power generation (CHP) systems to produce cooling, heat and electricity is growing rapidly due to their high efficiency and low emissions in commercial and industrial applications. In conventional CHP systems the deficit of the system power can be purchased from the grid. However, this system cannot be used as the standalone application. The hybrid so...

متن کامل

Investigation on Equivalent Trans-utilization Mode and Benefit of Wind Energy

For economic benefit of wind power generation, the equivalent conversion relationships and models between the different “quality” energy are studied deeply in the conversion processes of wind energy. Considering the effect of load demand characteristics and energy supply price on the wind energy utilization mode comprehensively, the multi-objective trans-utilization optimization model of wind e...

متن کامل

Distributed Simulation-based Optimization for Resource Allocation of Multimodal Operation System on Container Terminals

Faculty of Symbiotic Systems Science, Fukushima University, Fukushima City, 960-1296, Japan This paper focuses on efficiency of simulation-based optimization for container terminal operation system. Firstly, the discrete-event simulation model of multimodal operation system on container terminal was presented with the queuing network, for which the mathematical optimization model of resource al...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009